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A distributed average consensus algorithm in which every sensor transmits with bounded peak 
■ power is proposed. In the presence of communication noise, it is shown that the nodes reach consensus 

' asymptotically to a finite random variable whose expectation is the desired sample average of the 

^ \ initial observations with a variance that depends on the step size of the algorithm and the variance 

of the communication noise. The asymptotic performance is characterized by deriving the asymptotic 
covariance matrix using results from stochastic approximation theory. It is shown that using bounded 
transmissions results in slower convergence compared to the linear consensus algorithm based on the 
CO ' Laplacian heuristic. Simulations corroborate our analytical findings. 
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I. INTRODUCTION 

Wireless sensor networks (WSNs) without a fusion center have the advantages of robustness to 
node failures and they can function autonomously without a central node controlling the entire 
network [1]. In such fully distributed networks, sensors collaborate with their neighbours by 
repeatedly exchanging information locally to achieve a desired global objective. For example, 
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the sensors could come to an agreement on the sample average (or on a global function) of 
initial measurements. This is called distributed consensus. Distributed consensus algorithms 
have attracted significant interest in the recent past and have found several applications in areas 
such as healthcare, environmental monitoring, military and home appliances (please see [2]-[8] 
and references therein). In this body of literature, it is often assumed that a given node can 
obtain exact information of the state values of its neighbours through local communications. 
This essentially means that the system consumes theoretically unlimited energy and bandwidth. 
However, practical WSNs are severely power limited and the available bandwidth is finite. 
Moreover, the main source of power consumption in a sensor is its transceiver [9]. Therefore, 
there is a need for consensus algorithms which work under strict resource constraints of power 
and bandwidth imposed by the WSNs. 

Sensors may adopt either a digital or analog method for transmitting their information to 
their neighbours. Digital methods of transmissions may be using low transmit power but require 
increased bandwidth especially when the number of quantization levels is high. Distributed 
consensus algorithms using quantized transmissions have been studied in [10]-[14]. The analog 
method consists of transmitting unquantized data by appropriately pulse shaping and amplitude 
or phase modulating to consume finite bandwidth. One such method is the amplify-and-forward 
(AF) scheme in which sensors send scaled versions of their measurements to their neighbours. 
However, using the AF technique is not a viable option for WSNs because it requires high 
transmission power when the values to be transmitted are large [15]. Moreover, the linear transmit 
amplifier characteristics required for AF are often very power-inefficient [16], requiring the 
study of the effect of non-linear transmissions on performance. In distributed systems which 
employ the AF technique for transmission of the sensed data, it is often assumed that the power 
amplifiers used are perfectly linear over the entire range of the sensed observations. In practice, 
the amplifiers exhibit non-linear behaviour when the amplitude of the sensed data is relatively 
high [16]-[18]. 

In this paper, we propose a non-linear distributed consensus (NLC) algorithm in which 
every sensor maps its state value through a bounded function before transmission to constrain 
the peak transmit power. Therefore the magnitude of the transmitted signal at every node 
in every iteration is always bounded, making it ideal for resource-constrained WSNs. In the 
presence of communication noise, we prove that all the sensors employing the NLC algorithm 
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reach consensus to a finite random variable whose mean is the desired sample average. We 
characterize the asymptotic performance by deriving the asymptotic covariance matrix using 
results from stochastic approximation theory. We show that using the NLC algorithm results in 
larger asymptotic covariance compared to the linear consensus algorithm. Finally we explore 
the performance of the proposed algorithm employing various bounded transmission functions. 
Different from [8] which also considered consensus in the presence of noisy transmissions, 
herein we analyse non-linear transmissions and study the asymptotic covariance matrix and its 
dependence on the non-linearity. Our work in this paper also studies the merits and demerits of 
distributed schemes involving realistic amplifier models with non-linear characteristics such as 
the ones discussed in [16], [17]. 

The rest of this paper is organized as follows. We begin by reviewing some basics of network 
graph theory in Section II. In Section EI, we describe the system model and review the previous 
work on non-linear consensus. We consider the NLC algorithm in the presence of noise in Section 
rv, and prove that the sensors reach consensus to a random variable. In Section V, we present 
several simulation examples to study the performance of the proposed algorithm. Concluding 
remarks are presented in Section VI. 

Notations and Conventions 

Vectors are denoted by boldface upper-case or lower-case letters and matrices are denoted by 
boldface upper-case letters. max{ai, 02} denotes the maximum of ai and a2. diag[ai, 02, a^r] 
denotes an N x N diagonal matrix whose diagonal elements are given by ai, a2, . . . , oat. E[ ] 
denotes the expectation operator and I denotes the identity matrix. The symbol || ■ || denotes 
the l,^ norm for real vectors and spectral norm for symmetric matrices. For a matrix M, Aj(M) 
denotes the i*^ smallest eigenvalue. The vector 1 denotes an N x 1 column vector of all ones, 
1 = [1 

II. Review of Network Graph Theory 

In this paper, we model a sensor network as an undirected graph. In this section, we provide 
a brief background on network graph theory which we will use to derive our results. Consider 
an undirected graph G = (N, E) containing a set of nodes N = {1, . . . , A^} and a set of edges 
E. Nodes that communicate with each other have an edge between them. We denote the set of 
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neighbours of node i by Nj, Nj = { j\{i,j} € E} where indicates an edge between the 
nodes i and j [19]. A graph is connected if there exists at least one path between every pair of 
nodes. We denote the number of neighbours of a node i by di and dmax = maxj di. The graph 
structure is described by an x symmetric matrix called the adjacency matrix A = {aij}, 
Qij = 1 if {i, j} e E. The diagonal matrix D — diag[cii, d2, . . . , d]^] captures the degrees of 
all the nodes in the network. The Laplacian matrix of the graph is given by L = D — A. 

The graph Laplacian characterises a number of useful properties of the graph. The eigenvalues 
of L are non-negative and the number of zero eigenvalues denotes the number of distinct 
components of the graph. When the graph is connected, Ai(L) = 0, and Ai(L) > 0,i > 2, 
so that the rank of L for a connected graph is A^ — 1. The vector 1 is the eigenvector of L 
associated with the eigenvalue 0, i.e, LI = 0. The eigenvalue A2(L) characterizes how densely 
the graph is connected and the performance of consensus algorithms depend on this eigenvalue 
[20]. 

in. System Model and Previous Work 

A. System Model 

Consider a WSN with A^ sensor nodes each with an initial measurement Xi{0) e M. Measure- 
ments made at the sensor nodes are modeled as 

Xi{0) = e + ni, t = l,...,N (1) 

where ^ is an unknown real-valued parameter and is the sensing noise at the i^^ sensor. The 
sample mean of these initial measurements in (1) is given by 

1 ^ 

^=]7E^^(0)- (2) 
1=1 

Let X be the estimate of the parameter 9 to be computed by an iterative distributed algorithm, in 
which each sensor communicates only with its neighbours. If the states of all the sensor nodes 
converge to x, then the network is said to have reached consensus on the sample average. 

B. Previous Work 

A commonly used iterative algorithm for distributed consensus can be written as 
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Xi{t + 1) = Xi{t) - h{xi{t) - Xij{t)) , (3) 

where i — 1, . . . , AT, i = 0, 1, 2, . . ., is the time index, Xi{t + 1) is the updated state value of 
sensor node i at time t + 1, is the set of neighbours of sensor node i, Xij{t),j e are the 
state values of the neighbours of sensor node i at time t, and a is a constant step size. If h{-) is 
linear, then (3) is a linear distributed average-consensus (LDAC) algorithm [2], [5], [20]. In [2], 
it is proved that if < a < 2/AAr(L), then Xi{t) converges to x exponentially and (3) is then 
called as the LDAC algorithm based on the Laplacian heuristic. If h{-) is non-linear then the 
algorithm belongs to the class of non-linear distributed average-consensus algorithms [4], [21]. 
In [4], the average consensus problem is solved when h{x) in (3) is differentiable and odd. In 
[21], it is illustrated that when h{x) in (3) is sin(a;), faster convergence is possible compared 
to the LDAC algorithm based on the Laplacian heuristic. In all of these cases, Xij{t) has to be 
transmitted to node i before it can apply the function h{-) to get the new updated state value. 
Therefore, the transmit peak power in (3) is determined by Xi{t) and not necessarily bounded, 
even if h{-) is bounded. Moreover, there is no communication noise assumed in all the previous 
work on non-linear consensus. 

IV. Consensus with Bounded Transmissions and Communication Noise 

In this work, we propose a distributed non-linear average consensus algorithm in which every 
sensor maps its state value through a bounded function before transmission to constrain the 
transmit power. Therefore the magnitude of the transmitted signal at every node in every iteration 
is always bounded making it ideal for resource-constrained WSNs. 

In this section, we will study the NLC algorithm with communication noise when sensors 
exchange information. Our approach is similar to, but more general than [8] in that we analyse 
non-linear transmissions. Moreover, unlike [8] we study the asymptotic covariance matrix of the 
state vector and its dependence on the non-linearity. Unlike [21] and [4], we assume transmit 
non-linearity which allows for bounded transmissions. Moreover, we consider the presence of 
communication noise. 
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A. The NLC Algorithm with Communication Noise 

Let each sensor map its state value at time t through the function h{x) before transmission, 
and consider the following NLC algorithm with communication noise: 

Xi{t + 1) - Xi{t) - ait) [h{x^{t)) - h{xij{t)) + n^j{t)] , (4) 

where i = 1, . . . , A^, i = 0, 1, 2, . . ., is the time index. The value Xi{t + 1) is the state update of 
node i at time t+1, Xij{t) is the state value of the j*'* neighbour of node i at time t and a{t) is 
a positive step size which will further be assumed to satisfy assumption (A4) in the sequel. The 
node j transmits its information Xij{t) by mapping it through the function h{x), node i receives 
a noisy version of h{xij{t)) and nij{t) is the noise associated with the reception of h{xij{t)). 

Note that the proposed scheme (4) is different from (3) in the following aspects. Firstly, in (3), 
Xij{t) has to be transmitted which could exhibit variation over a wide range of values if Xi{0) 
has a large dynamic range and hence (3) does not guarantee bounded transmission power. In 
contrast, in the proposed scheme the non-linearity is applied before the state value is transmitted 
so that the magnitude of the transmitted state value is always constrained within the maximum 
value of h{x) irrespective of the range of Xi{t) and the realizations of noise nij{t). Finally, (4) 
involves communication noise while (3) does not. Thus the proposed scheme is more suited to 
resource constrained WSNs when compared to (3). 

The recursion in (4) can be written in vector form as 

X{t + 1) = X{t) - a{t) [Lh(X(i)) + n{t)] , (5) 

where X(t) G is the state vector at time t given by X(t) = [xi(t) X2{t) . . . xj^(t)]^, and 
h : ^ such that h(X(t)) = [h{xi{t)) h{x2{t)) . . . h{xN{t))f. The vector n{t) captures 
the additive noise at N nodes contributed by their respective neighbours and its i*'* component 
is given by 

Our model in (5) is more general than the linear consensus algorithm considered in [8] which 
is a special case of h(x) when it is linear. We make the following assumptions on h{x), nij{t), 
a{t) and the graph: 
Assumptions 
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(Al): The graph G is connected so that A2(L) > 0. 

(A2): The function h{-) is differentiable, and has a bounded derivative such that < h'{x) < c, 
for some c > 0. 

(A3) Independent Noise Sequence: The channel noise {nij{t)}t>o,i<i,j<N is an independent 
sequence across time and space. It also satisfies 

E[nij{t)] = ,1 <i,j < N,t>0 , sup E[nl{t)] < < oo. (7) 

i,j,t 

From (6) we have 

E[n(t)] =0,yt, II :^ supE[||n(t)|n < Nd.^^a'' < oo. (8) 

t 

Note that (8) is because of the fact that the number of neighbours of a given node is upper 
bounded by dmax- 

(A4) Decreasing Weight Sequence: The channel noise in (5) could make the algorithm diverge. 
In order to control the variance growth rate of the noise we need the following conditions on 
the sequence a{t): 

oo oo 

a{t) > , ^ a{t) = oo , 2^ a^{t) < oo . (9) 

t=0 t=0 

Our primary motivation for considering non-linear transmissions is to impose the realistic 
assumption of bounded peak per-sensor power by ensuring that h{-) is bounded. However, as 
seen in (A2) this assumption is not needed for our subsequent development as long as h'{-) is 
bounded. 

We will prove convergence and asymptotic normality result of the NLC algorithm in (5). For 
the sake of clarity, we now present a result on the convergence of a discrete time Markov process 
which will be used in establishing convergence of the NLC algorithm in (5). 

B. A Result on the Convergence of Discrete time Markov Processes 

Let X = {X(t)}t>o be a discrete time vector Markov process on M^. The generating operator 
£ of A" is defined as 

jCV{x) = E [V{X{t + 1)) \X{t) = x] - y(x) (10) 

for functions y(x),x G M^, provided that the conditional expectation exists. Let C C and 
its complement be C' = \ C. We now state the desired result as a simplification of Theorem 
2.7.1 in [22] (see also Theorem 1 in [8]). 
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Theorem 1. Let X be a discrete time vector Markov process with the generator operator C as 
in (10). If there exists a potential function V(x) : — > and C C with the following 
properties 



y(x) > 0,x e C', T/(x) = 0, X e C , 
>Cl^(x) < -7(t)<^(x) + mg{t)[l + V^s)] 
where m > 0, 92 (x) is such that 

</7(x) = 0, X e C, (/7(x) > 0, X e c' , 

and 

00 00 
7(i) > 0,^(i) > 0, J]7(i) = 00, < 00 , 



(11) 
(12) 

(13) 
(14) 



t=o 



t=0 



discrete time vector Markov process X — {X(i)}t>o with arbitrary initial distribution 
converges almost surely (a.s.) to the set C as t ^ 00. That is, 



Pr 



lim inf llXft) - Yll = 

t->ooYeC 



1. 



(15) 



Intuitively, Theorem 1 indicates that if the one-step prediction error of the Markov process 
evaluated at the potential function in (10) is bounded as in (12) then it is possible to establish 
convergence of X(t). 

To prove the a.s. convergence of the consensus algorithm in (5) using Theorem 1, we define 
the consensus subspace C, the set of all vectors whose entries are of equal value as. 



C = {x e M^|x= al ,ae 
We are now ready to state the main result of Section IV. 



(16) 



Theorem 2. Let the assumptions (Al), (A3) and (A4) hold, and assume h{x) is strictly increasing. 

Consider the NLC algorithm in (5) with the initial state vector X(0) G M^. Then, the state vector 
X(t) in (5) approaches the consensus subspace C a.s., i.e., 



Pr 



lim inf ||X(i) - Yll = 

t-i-ooYeC 



= 1. 



(17) 
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Proof: We will make use of Theorem 1 to prove (17). We will choose an appropriate 
potential function ^(x) that is non-negative which satisfies equation (11). We will then prove 
that the generating operator C applied on ^(x) as in (10) can be upper bounded as in (12) with 
7(t) = a{t), and a 92(x) can be found that satisfies (13). 

First we see that under the assumptions (Al), (A2) and the assumption on h{x), the discrete 
time vector process {X(i)}t>o is Markov. Since L is a positive semi-definite matrix, it has 
an eigenvalue decomposition (EVD) given by L = USU^, where S is the diagonal matrix 
containing the eigenvalues of L in the increasing order, and U is a unitary matrix with 1 as 
its first column vector which corresponds to the eigenvalue. Define a positive semi-definite 
matrix M as a function of U such that M = UAU^ and A = diag[0, 1, 1 , . . . , 1]. Let 
V{yi) = x^Mx, then the function ^(x) is non-negative since M is a positive semi-definite 
matrix by construction. Note that x e C is an eigenvector of M associated with the zero 
eigenvalue, therefore we have 

l^(x)=0,xeC. (18) 

Let X = xc + xc_L where xc is the orthogonal projection of x on C. When x e C , we have 

||xc±|| > 0. Let X G C' and h(x) be as defined in (5). Then, h(x) = hc(x) + hc±(x), where 
hc±(x) is non-zero, i.e., ||hc±(x)|| > 0. Define (5 := ||hc±(x)p/||xc±p, then /3 > 0, x e C'. 
Therefore, for any x e C', 

y(x) = x^Mx = V(xc + xc±) = V{^c±) > min xJ^Mxc± = A2(M)||xc±||' > , (19) 

where the last inequality is due to A2(L) > by assumption (Al). The equations (18) and (19) 
establish that the conditions in (11) in Theorem 1 are satisfied. 

Now we will prove that (12) is satisfied as well. Towards this end, consider CV{x) defined 
in (10), 

jCV{x) = E \x{t + lfMX{t + l)\X{t) = xl - T/(x) , (20) 
= E [(x^ - a{t) (h(x)^L^ + n{tf)) ■ (Mx - a{t) (MLh(x) + Mn(t)))] 

- V{k) , (21) 
= -2a{t) [x^MLh(x)] + a\t) [h(x)^L^MLh(x) + E [n(t)^Mn(t)]] . (22) 
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We get (22) by expanding (21) and taking the expectations and using the fact that E[n(i)] = 0. 
Recall the EVDs of L and M from which we have 



LM = ML = USU^UAU^ = USU^ = L . 
Since A2(M) = Xn{M) ^ 1, we have 

E [n(i)TMn(i)] < E [A^(M)||n(i)||2] < /x, 



(23) 



(24) 



where the second inequality follows from (8) and the fact that AAr(M) = 1. Using (23) and (24) 
in (22), we get the following bound 



CV{^) < -2a{t) [x'^Lh(x)] + a\t) + //] , 

< -2a{t) [xTLh(x)] [A^(L)/3||xcx||' + /^] 

< -2a{t) [xTLh(x)] + a\t) 



/sM^x^Mx + z.' 



A2(M) 

< -2a{t) [x'^Lh(x)] + ma^it) [l + ^ax'^Mx] , 

< -a(t)<^(x) + ma^it) [1 + V(x)] , 



(25) 
(26) 

(27) 

(28) 
(29) 



where (/^(x) := 2x'^Lh(x), m := max{^A^(L)/A2(M), //}, ^2 := A^M and ^2 e (0, 1]. In (26), 
we have used the fact h(x)TL2h(x) < A^(L)||hc±(x)||2 and ||hc±(x)|p = /5||xc±|p. In (27), 
we have used the fact that x'^Mx > A2(M)||xc±P due to (19). We will now prove that (/^(x) 
in (29) satisfies equation (13) of Theorem 1. 

Recall that L is the Laplacian matrix of the graph and that 1 is in its null space, that is, 
LI = 0. Whenever x e C, i.e., x = al, a e R, then h(x) = 61 for some 6 e M. This implies 
Lh(al) = L61 = 0. Therefore we have (/^(x) = 2x'^Lh(x) = 0, Vx e C. 

To prove (/^(x) > when x e C', consider (/?(x) for a connected graph with L of dimension 
N xN, 



(/?(x) = 2x^Lh(x) 
= 2 



(30) 
(31) 



- Xj)h{xi) + ^ (a;2 - Xj)h{x2) + . . . + ^ {xn - Xj)h{xN) 

where (31) follows from the structure of the symmetric matrix L (recall L = D — A). Note that 
the i^^ summation in (31) corresponds to the i^^ node. Now suppose that node i is connected 
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to node j. Then there exists a tenn (xi — Xj)h{xi) in the summation corresponding to the i^^ 
node in (31), and a term (xj — Xi)h(xj) in the summation corresponding to the j*^ node in (31). 
Both of these terms can be combined j){h{xi) — h{xj)) and this corresponds to the 

edge {i,j} G E. Thus equation (31) can be written as pairwise products enumerated over all the 
edges in the graph as follows 

<^(x) = 2 ^ (xi- Xj){h{xi) - h{xj)) . (32) 
{ij}eE 

Since x G C', v'(x) in (32) is positive due to the fact that h{x) is strictly increasing so that there is 
at least one term in the sum which is strictly greater than zero. Letting 7(t) = a{t),g{t) = a'^{t) 
and by assumption (A4), we see that the sequence a{t) in (29) satisfies (14). Thus all the 
conditions of Theorem 1 are satisfied to yield (17). ■ 
Theorem 2 states that the sample paths of X(i) approach the consensus subspace almost 
surely. We note that the assumption (A2) is not necessary for Theorem 2 to hold. Instead we 
assumed h(x) is strictly increasing (not necessarily differentiable) to prove Theorem 2. Now, 
like in [8], we will prove the convergence of X(t) to a finite point in C in Theorem 3. 

Theorem 3. Let the assumptions of Theorem 2 hold. Consider the NLC algorithm in (5) with 
the initial state X(0) G M^. Then, there exists a finite real random variable 9* such that 



Pr 



lim X(i) = e*l 

.t—>oo 

T 



1. (33) 



Proof: Let the average of X(t) be x{t) = 1 X(t)/A^. Since lx{t) G C, Theorem 2 implies. 



Pr 



lim \\X{t)-x{t)l\\ = 



= 1 , (34) 



where (34) follows from (17) since the infimum in (17) is achieved by Y = x{t)l. Pre- 
multiplying (5) by on both sides and noting that l^Lh(X(t)) = we get. 



x{t + l) = x{t) -v{t) (35) 
= x{0) - J2 ^(^) (36) 



0<k<t 
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where v{t) = a{t)l'^n{t)/N. From (A3), it follows that 

E[v{t)] = 0, 

E mt)? = E ^nn{t) ir < |, E < - 

which implies 

E[x{t + 1)]2 < x'{0) + ^ '^^ ■ (37) 

t>o 

Equation (37) implies that the sequence {x{t)}t>o is an £,2 bounded martingale ^ and hence 
converges a.s. and in £2 to a finite random variable 9* (see [22, Theorem 2.6.1]). Therefore the 
theorem follows from (34). ■ 
It should be noted that the results in Theorems 2 and 3 are similar to the results in [8], but 
we have proved it for a more general case of which [8] is a special case when h(x) = x. In 
what follows, we present the properties of the limiting random variable 9*. 

C. Mean Square Error of NLC Algorithm 

The Theorems 2 and 3 establish that the sensors reach consensus asymptotically and converge 
a.s. to a finite random variable 9*. We can view 9* as an estimate of x. In the following theorem 
we characterize the unbiasedness and means squared error (MSE) properties of 9*. We define 
the MSE of 9* as = E[(^* - xf]. 

Theorem 4. Let 9* be the limiting random variable as in Theorem 3. Then 9* is unbiased, 

E[9*] = X, and its MSE is bounded, < iiN~^ ^ a'^{t). 

t>o 

The proof is obtained by following the same steps of the Lemma 5 in [8]. 

We point out that with non-linear transmissions, we have obtained the same bound on the 
MSE as that of the linear consensus algorithm in [8]. It should be noted that // < Ndma^cr^ 
from (8) which implies that < rfmax^^I^t>o<^^(^)o"^- Therefore, if rfmax is finite for a 
large connected network, we have Um^v-j-ooCAr = ^iid this means that 9* converges to x as the 

'a sequence of random variables {y(i)}t>o is called as a martingale if for all t > Q, E[|j/(t)|] < oo and 
E [y{t + 1) I 2/(1) 2/(2) . . . y{t)] — y(t). The sequence {y{t)}t>o is an £2 bounded martingale if sup^ E [2/^(t)] < 00 (see 
[23, pp. 110]). 
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variance of 6* approaches 0. If the graph is densely connected, then dmax is relatively high which 
increases the worst-case MSE. On the other hand, when the graph is densely connected, A2(L) is 
larger which aids in the speed of convergence to 9*, as quantified through the covariance matrix 
in Section IV-D. 

For any connected graph with N nodes, if cr^ = then limt_>.oo ^{t) — xl, which means all 
the sensor states asymptotically converge to the desired sample average. In fact, in the absence of 
communication noise, under assumptions (Al) and (A2), we believe that it is possible to prove 
exponential convergence of X(i) to xl by letting a{t) — a such that < a < 2/(cAjv(L)) and 
by following a similar approach as in [21]. 

Similar results as in Theorems 2 and 3 could be easily proved under more general assumptions. 
For example, the graph can be randomly varying over time due to link failures. As long as the 
graph is connected on an average, it can be easily proved that the Theorems 2 and 3 hold. The 
independent assumption on the noise sequence can also relaxed and the noise sequence can be 
allowed to depend on X(i). For detailed discussions on these assumptions and its variations, 
please see Section III-A in [8]. We do not pursue these extensions herein since our focus is on 
studying the effect of non-linear transmissions on performance. 

D. Asymptotic Normality of NLC Algorithm 

The NLC algorithm in (5) belongs to the class of stochastic approximation algorithms. The 
convergence speed of these algorithms is an important issue from a practical perspective. There 
are various criteria for determining the rate of convergence. For instance, one can try to estimate 
E [||X(t) - 6'*lf ] or Pr [||X(t) - 6'*1|| < e(t)] [24]. Estimating these parameters may be difficult 
in practice. However, it is usually possible to establish that v^(X(t) — 9*1) is asymptotically 
normal with zero mean and some covariance matrix. Asymptotic normality of stochastic ap- 
proximation algorithms have been established under some general conditions in [22] and for the 
linear consensus algorithms in [6]. 

In this section, we establish the asymptotic normality of the NLC algorithm in (5). Our 
approach here is similar to the one in [6]. Basically, we decompose the NLC algorithm in 
into a scalar recursion and a recursion in M^^^^). In this section, for the sake of simplicity we 
assume that the noise sequence {n{t),t > 0} are i.i.d. random vectors with zero mean and finite 
covariance. We now formally state and prove the result as a theorem. 
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Theorem 5. Let a{t) — a/t,a > 0, then the NLC algorithm in (5) becomes 



X{t + 1) = X(t) + J [-Lh{X{t)) + n{t)] . 

V 



(38) 



Suppose that the assumptions (Al), (A2), (A3) and (A4) hold and that the noise sequence 
{n(t),t > 0} are i.i.d. across time and space with zero mean and covariance a^l. Let the EVD 
of L be given by L = USU^, where U is a unitary matrix whose columns are the eigenvectors 
o/L such that 

1 



U 



N 





B 



(39) 



where B e R(^-i)x(^-i) 

is a diagonal matrix containing the N — 1 negative eigenvalues of 
— L (this means that B is a stable matrix). In addition, let 6q be a realization of the random 
variable 6* and 2a\2(L)h' {9o) > 1 so that the matrix [a/i'(6'o)B + 1/2] ,9q & M. is stable. 
Define [n{t) h{t)^]^ N-y^\]^n{t), h{t) e M^^"^), so that n{t) = N-^l^n{t) and h{t) = 
N-^/^^'^n{t). LetC^ E[hn^], C e R(n-^)x(n-^), Then, as t ^ oo. 



Vt{X{t) - OqI) ~ (0, AT-^aV^ll^ + Ar-i#S^°^^) , (40) 

where 

oo 

S^o ^ ^2 yja/.' (eo)B+i]t ^ ^[ah'(e,)B+L]t^^ ^^^^ 


Proof: Define [x{t) X{tf]"^ := N-^/^V^X{t),X{t) e R^^-^). From Theorem 3, we have 
X(i) ^ 9*1 a.s. as t ^ oo which impUes that [x{t) X{t)\^ [9* a.s. as t ^ oo, and 
therefore X{t) a.s. as i — > oo. The error [X{t) — 9ol\ can be written as the sum of two 
error components (see also Section VI in [6]) as given below 

[X{t) - 9ol] = [x{t) -9o]l + 4^^Mt) , (42) 

= ei + 62 , (43) 

where ei = [x{t) — 9o]l and 62 = N~^^'^^X{t). By calculating the covariance matrix between 
ei and 62, it can be proved that they are asymptotically uncorrelated as t — )> 00, and that 
asymptotically Viei ~ J\f{0, N'^a'^a'^ll'^) (see Theorem 12 in [6]). To show that Vte2 is 
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asymptotically normal, it suffices to show that v^X(i) is asymptotically normal. To this end, 
express h{x) in (38) around x — 9q using Taylor's series expansion, 

h{x) = h{eo) + K{eQ){x - eo) + o{\x- 0o\) , as x ^ ■ (44) 

Using (44) in (38) we get 

X{t + 1) = Xit) + J [-L (h{eo)l + /i>o)[X(i) - ^ol]) + <5(X(i)) + n(i)] , (45) 

, as i — >■ oo , (46) 



a 



= X{t) + - h {60) (-LX(i)) + S{X{t)) + n{t) 

where ||(5(X(t))|| — )■ as i ^ 00. Pre-multiplying (46) on both sides by A'"~^/^U'^ and using 
(39) we get the following recursions 

x{t + 1) = x{t) + ^n{t) , (47) 

V 



X{t + 1) = X{t) + ^ h'{eo)BX{t) + d{X{t)) + n(t) 



, as t ^ 00 , (48) 



t 

where 5(X(t)) = N-^/^^'^5{X{t)). With the assumption that [ah' {9o)B + 1/2] ,60 e R is a 
stable matrix, it can be verified that all the conditions of Theorem 6.6.1 in [22, p. 147] are satisfied 
for the process X(t) in (48). Therefore, for a given 9q, the process y/tX{t) is asymptotically 
normal with zero mean and covariance matrix given by (41). Since v^ei ~ A/'(0, N'^a'^a^ll'^) 
and using (41) together with the fact that ei and 62 are asymptotically independent as t — > 00, 
we get (40) which completes the proof. ■ 

Equation (40) indicates how fast the process X{t) will converge to 60I for a given ^o- The 
convergence speed clearly depends on h'{9o). We note that if h{x) = x, then h' (Of^) = 1,\/9q e M, 
and substituting this in (41), we get the results for the linear case as in Theorem 12 of [6]. 

Let the asymptotic covariance in (40) be denoted by Cnic- Since n(t) are i.i.d., C in (41) 
becomes C = a^I and thus we have Cnic = N~'^a'^a^ll^ +N~^^S^°^^ where is a diagonal 
matrix whose diagonal elements are given by S^? = a^a^/[2ah' {9o)Xi+i(L) — 1]. A reasonable 
quantitative measure of largeness [24] of the asymptotic covariance matrix is ||Cnic|| which is 
the maximum eigenvalue of the symmetric matrix Cnic- Further, ||Cnic|| can be minimized with 
respect to the parameter a. This can be formulated as the following optimization problem, 

min max x^'^CnicX , (49) 

{a|2a/i'(6>o)A2(L)>l} {x|xeR'V,||x||2<l} 
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which can be solved analytically by using the KKT conditions [25]. The value of a that optimizes 

(49) is a*ip = {N + 1) / [2N \2{L)h' {9o)] and the corresponding optimal value of the ||Cnic||is 
given by 

The size of the asymptotic covariance matrix in (50) is inversely proportional to the square of 
the smallest non-zero eigenvalue A2(L) which quantifies how densely a graph is connected. We 
also note that (50) is directly proportional to the channel noise variance a^. 

Equation (50) also gives some useful insights to design the transmission function h{x). If 
we choose two functions hi{x) and h2{x) such that h'i{x) > }i2{x)yx e R, it is easy to see 
from (50) that ||C*ij.^|| < ||C*jp2l|, V^o £ I^- This means that the convergence will be faster 
when hi{x) is employed in the NLC algorithm (5) than when h2{x) is employed. However, it 
should be noted that if h!^{x) > h'2{x),yx e M and suppose hi{0) = h2{0) = then we have 
hKx) > h2{x),\/x which implies that on an average the transmit power is greater when hi{x) is 
employed compared to h2{x). We will illustrate these findings in the simulations in Section V. 
Comparing the ||C*ij.|| against the special case of h{x) — cx yields ||C*ip|| = ||Ci*jj||(c//i'(^o))^- 
Clearly c/h'{9o) < 1 and therefore if h{x) is bounded, appropriately normalized by letting 
c = 1, so that < h'{x) < 1, we conclude that the best case linear algorithm outperforms the 
best case NLC algorithm in terms of speed of convergence. However, the improved asymptotic 
covariance matrix in the former is achieved at the cost of increased peak and average transmit 
power compared to the latter. 

V. Simulations 

In this section, we corroborate our analytical findings through various simulations. In all the 
simulations presented, the initial samples Xi{0) e R, i = 1,2,. . .,N, were generated randomly 
using Gaussian distribution with a standard deviation equal to 10. The desired global average 
value is indicated in each of the simulations. We focus here on bounded transmission functions 
to study their performance. Please note that our results are valid for a broader class of increasing 
functions (see Section IV-A) than the ones considered in this section. 
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A. Performance of NLC Algorithm Without Channel Noise 

Our focus in this paper is on non-linear transmissions in the presence of noise. However, we 
would also like to illustrate the convergence behavior on the absence of noise. Figures 1, 2 and 3 
depict the performance of the proposed NLC algorithm in the absence of channel noise for a large 
network with N — Ib.ln all the cases, we have used a values such that < a < 2/(cAAr(L)) 
as mentioned in Section IV-C. 

From Figure 1, we infer that in about 50 iterations, all the nodes reach consensus on the 
desired global average of x = 76. Figure 2 shows evolution of error norm ||X(t) — for 
various bounded functions. We see that the convergence is exponential in all cases as noted 
in Section IV-C. Figure 3 illustrates the performance of the NLC algorithm when a is varied. 
Interestingly, by adjusting the step size a it is indeed possible to achieve the same convergence 
speed using the NLC algorithm as that of optimal linear consensus algorithm using the Laplacian 
heuristic [2]. 

B. Performance of NLC algorithm with Channel Noise 

Figures 4-8 illustrate the performance of NLC algorithm in the presence of communication 
noise. As explained in the assumption (A4) in Section IV-A, we chose the decreasing step 
sequence to be a{t) — l/{t-\-l),t > 0, in all simulations. Here we assumed that p — maxx h'^{x) 
is the maximum power available at each sensor to transmit its state value. Figure 4 shows that 
the nodes employing the NLC algorithm reach consensus for a small network with N — 10. 
Figure 5 shows the transmit power h'^{xi{t)),i = 1,2, .... N, per-neighbour versus iterations for 
a large network. Clearly, the transmit power is always constrained within the upper bound of 
p (indicated by the dashed line) making the proposed scheme practically viable for the power 
constrained WSNs. 

In Figures 6, 7 and 8, we show the convergence speed performance of the proposed NLC 
algorithm by plotting | |E[X(i)] —xl\\ versus iterations t. These plots indicate how fast the mean 
of the process X(i) converges towards the desired global mean vector xl. 

In Theorem 5, we saw that if two functions hi{x) and h2{x) such that h[{x) > /i2(a;), G M, 
are employed in the NLC algorithm then the convergence will be faster for hi{x) compared to 
that of h2{x). This is illustrated in Figure 6 where we have chosen hi{x) = y/ptan~^(wa;) and 
h2{x) — y/pta,nh.{u!x). The performance gain of hi{x) obtained over h2{x) can be understood 
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intuitively by observing that on an average the transmit power will be more when hi{x) is 
employed than when hi{x) is employed. The speed of convergence for various transmit functions 
appropriately normalized to have the same peak power p is shown in Figure 7. Here, we see 
that the transmit function hi{x) has the best performance and h^^x) has the worst performance. 
Intuitively this is due to the fact that h[{x) > h^ix) > h'^{x) > h'^{x). Finally, we depict the 
convergence speed versus the power scaling constant p, the upper bound on the transmit power, 
in Figure 8. For a given transmit function, increased power leads to faster convergence as would 
be expected, and we also observe that when the consensus iterations were increased, speed of 
convergence improves. 

VI. Conclusions 

A distributed consensus algorithm in which every sensor maps its state value through a bounded 
function before transmission to constrain the transmit power is proposed. The transmitted signal 
power at every node in every iteration is always bounded irrespective of the state value or the 
communication noise, which is a desirable feature for low-power sensors with limited peak 
power capabilities. In the presence of communication noise, it is proved using the theory of 
Markov processes that the sensors reach consensus asymptotically on a finite random variable 
whose expectation contains the desired sample average of the initial sensor measurements, and 
whose mean-squared error is bounded. The asymptotic convergence speed of the proposed 
algorithm is characterized by deriving the asymptotic covariance matrix using results from 
stochastic approximation theory. While the proposed NLC algorithm has the desirable feature 
of bounded transmit power, it is shown that using the best case NLC algorithm results in larger 
asymptotic covariance compared to the best case linear consensus algorithm. In the absence of 
communication noise, it is illustrated that the network achieves consensus on the global sample 
average exponentially fast provided the step size is chosen appropriately and that by adjusting 
the step size, it is possible to achieve the same speed of convergence as that of the best case 
linear consensus algorithm using Laplacian heuristic. 
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Fig. 1. Entries of X(i) versus Iterations t: a = 1.5, lj = 0.01, A'^ = 75, h{x) = tanh(a;a;), x = 76. 
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Fig. 2. Evolution of error ||X(t) — xl\\ versus Iterations t: a = 1.5, uj = 0.01, N — 75, x — 76. 
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Fig. 3. Evolution of error ||X(t) — xl\\ versus Iterations t: a — 2, 4, 6,8, lj — 0.005, A'^ = 75, h{x) — ^ tan^^[sinh(-|tJ2;)], 
X = 114. 
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Fig. 4. Entries of X(t) versus Iterations t: h{x) = tanli(aja;), u = 0.05, = 10, x = 36.24, p = 10 dB, al = 1. 
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Fig. 5. Transmit power h?{xi{t)) per-neighbour versus Iterations t: h{x) = y^tanh([ja;), uj = 0.005, — 75, x — 102, 
p = 7.5 dB, = 0.1. 
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Fig. 6. ||E[X(t)] - xl\\ versus Iterations t: hi{x) = ^ptan ''{lux), h2{x) = ^tanh(a;a::), cj = 0.005, iV = 75, 
X = 162, 202, p = 7.5 dB, al = 1. 
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Fig. 7. ||E[X(t)] - xl\\ versus Iterations t: ui = 0.04, A*' = 10, x = 36.24, p = 5 dB, = 1. 
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Fig. 8. llE[X(t)] versus p: = Jp—r^ , 1^ = 0.006, = 75, x = 77, Iterations t = 20,40,60,80, al = 1. 
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